Adjusting Occurrence Probabilities of Automatically-Generated Abbreviated Words in Spoken Dialogue Systems
نویسندگان
چکیده
Users often abbreviate long words when using spoken dialogue systems, which results in automatic speech recognition (ASR) errors. We define abbreviated words as sub-words of an original word and add them to the ASR dictionary. The first problem we face is that proper nouns cannot be correctly segmented by general morphological analyzers, although long and compound words need to be segmented in agglutinative languages such as Japanese. The second is that, as vocabulary size increases, adding many abbreviated words degrades the ASR accuracy. We have developed two methods, (1) to segment words by using conjunction probabilities between characters, and (2) to adjust occurrence probabilities of generated abbreviated words on the basis of the following two cues: phonological similarities between the abbreviated and original words and frequencies of abbreviated words in Web documents. Our method improves ASR accuracy by 34.9 points for utterances containing abbreviated words without degrading the accuracy for utterances containing original words.
منابع مشابه
Expanding vocabulary for recognizing user's abbreviations of proper nouns without increasing ASR error rates in spoken dialogue systems
Users often abbreviate long words when using spoken dialogue systems, which results in automatic speech recognition (ASR) errors. We define abbreviated words as sub-words of the original word, and add them into an ASR dictionary. The first problem is that proper nouns cannot be correctly segmented by general morphological analyzers, although long and compounded words need to be segmented in agg...
متن کاملAdaptive Spoken Dialogue Systems
Adaptive systems cover a broad range of interactive systems which adjust to new tasks, situations, users or expressions. These systems identify and classify relevant features to develop over time, adjusting their behaviour to different users and situations. The topic of this paper is adaptation in spoken dialogue systems based on features in the dialogue. These systems automatically extract dia...
متن کاملViability of a Simple Dialogue Act Scheme for a Tactical Questioning Dialogue System
User utterances in a spoken dialogue system for tactical questioning simulation were matched to a set of dialogue acts generated automatically from a representation of facts as 〈object, attribute, value〉 triples and actions as 〈character, action〉 pairs. The representation currently covers about 50% of user utterances, and we show that a few extensions can increase coverage to 80% or more. This ...
متن کاملAutomatically predicting dialogue structure using prosodic features
Spoken dialogue systems need to track dialogue structure in order to conduct sensible conversations. In previous work, we used only a shallow analysis of past dialogue in predicting the current dialogue act. Here we show that a hierarchical analysis of dialogue structure can significantly improve dialogue act recognition. Our approach is to integrate dialogue act recognition with speech recogni...
متن کاملTowards Natural Clarification Questions in Dialogue Systems
Clarifications are often necessary for maintaining human-human as well as human-machine dialogue. However, clarification questions asked by Spoken Dialogue Systems (SDS) are very different from clarification questions asked in natural human interaction. While in human-human dialogues, speakers ask targeted questions using contextual information, SDS ask generic clarifications such as please rep...
متن کامل